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POLYMERASE SIGNALING ASSAY 



1. FIELD OF THE INVENTION 
5 The present invention relates to the field of nucleic 

acid sequence analysis. Specifically, the invention provides 
methods of nucleic acid sequence analysis which use 
combinatorial sequence array primers to sequence and/or 
detect mutations or polymorphisms within a template nucleic 
10 acid. 

2 . BACKGROUND OF THE INVENTION 
This invention relates to the field of nucleic acid 
sequence analysis. The analysis of nucleic acid sequences 

15 can be used, e.g., to determine the presence or absence of a 
particular genetic element. Variant genetic elements of a 
nucleic acid sequence usually exist. Exemplary variant 
genetic elements may include, but are by no means limited to, 
genetic mutations or polymorphisms such as single nucleotide 

20 polymorphisms ("SNP's"), base deletions, base insertions, and 
heterozygous as well as homozygous polymorphisms. 
Accordingly many techniques have been developed to compare 
homologous segments of nucleic acid sequence to determine if 
the segments are identical or if they differ at one or more 

25 nucleotides. Practical applications of these techniques 
include genetic disease diagnoses, infectious disease 
diagnoses, forensic techniques, paternity determinations, and 
genome mapping. 

In general, the detection of nucleic acids in a sample 

3 0 and of the subtypes thereof depends on the technique of 
specific nucleic acid hybridization in which the 
oligonucleotide probe is annealed under conditions of high 
stringency to nucleic acids in the sample, and the 
successfully annealed probes are subsequently detected (see, 

35 e.g., Spiegelman, S., 1964, Scientific American 210:48). 

The most definitive method for comparing DNA segments is 
to determine the complete nucleotide sequence of each 
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segment. Examples of how sequencing has been used to study 
mutations in human genes are included in the publications of 
Engelke et al . (1988, Proc. Natl. Acad. Sci . U.S.A. 85:544- 
548) and Wong et al . (1987, Nature 330:384-386). The most 
5 commonly used methods of nucleic acid sequencing include the 
dideoxy-mediated chain termination method, also known as the 
"Sanger Method" (Sanger, F. et al . , 1975, J. Molec. Biol. 
94:441; Porbe, J. et al . , 1987, Science 238:336-340) and the 
chemical degradation or "Maxam-Gilbert" method (Maxam, A.M. 

10 et al., 1977, Proc. Natl. Acad. Sci. U.S.A. 74:560). 

Both the Sanger and Maxim-Gilbert methods comprise a 
series of four chemical reactions, one for each of the 
nucleotide bases, e.g., A, C, G, and T for DNA, consisting of 
either primer extension (Sanger) or partial cleavage (Maxim- 

15 Gilbert) reactions. The reactions produce four sets of 

nested nucleic acid molecules whose lengths are determined by 
the location of a particular base along the length of the 
nucleic acid molecule being sequenced. The nested reaction 
products are then resolved by electrophoretic gels. 

2 0 The separation and analysis of reaction products on 

electrophoretic gels is a laborious and time consuming step. 
Accordingly, alternative methods have been developed to 
sequence nucleic acid molecules. For example, there is 
considerable interest in developing methods of de novo 

25 sequencing using solid phase arrays (see, e.g., Chetverin, A. 
B. et al., 1994, Bio /Technology 12:1093-1099; Macevicz, U.S. 
Patent No. 5,002,867; Beattie, W.G. et al . , 1995, Molec. 
Biotech. 4:213-225; Drmanac, R.T., EP 797683; Gruber, L.S., 
EP 787183; each of which is incorporated herein by reference 

30 in its entirety) . These methods consist primarily of 

hybridization of template nucleic acids to arrayed primers 
containing combinatorial sequences which hybridize to 
complementary sequences on the template strand. The methods 
combine the capture of the template, by formation of stable 

35 duplex structures, with sequence discrimination due to 
instability of mismatches between the template and the 
primer. 
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Such methods must typically employ arrays of primers at 
least twelve bases in length which contain approximately 16 
million sequence combinations. Such arrays are very complex 
and time consuming both to construct and to analyze. Thus, 
5 at the present time it is not practical to use extensive 
sequencing methods, such as the methods described above, to 
compare more than just a few DNA segments because the effort 
required to determine, interpret, and compare complete 
sequence information is time-consuming. 

10 Restriction fragment length polymorphism ( "RFLP" ) 

mapping is another commonly used screen for DNA polymorphisms 
arising from DNA sequence variation. RFLP consists of 
digesting DNA with restriction endonucleases and analyzing 
the resulting fragment by means of Southern blots, as 

15 described by Botstein et al . , 1980 (Am. J. Hum. Genet. 
32:314-331) and White et al . (1988, Sci . Am. 258:40-48). 
Mutations that affect the recognition of sequence of the 
endonuclease will preclude enzymatic cleavage at that site, 
thereby altering the cleavage pattern of the DNA. DNAs are 

20 compared by looking for differences in restriction fragment 
lengths. However, a major problem with RFLP mapping is its 
inability to detect mutations that do not affect cleavage 
with a restriction endonuclease. Thus, many mutations are 
missed with this method. Further, the methods used to detect 

25 restriction fragment length polymorphisms are very labor 

intensive, particularly the techniques involved with Southern 
blot analysis. 

Alternative, simpler methods have been developed which 
use solid phase arrays to analyze single nucleotide 

30 polymorphisms (SNP's). These techniques rely on the fact 
that analysis of SNP's, which constitute sites of variation 
flanked by regions of invariant sequence, requires no more 
than the determination of the identity of the single 
nucleotide present at the site of variation. 

35 For example, several primer-guided nucleotide 

incorporation procedures for assaying polymorphic sites in 
DNA have been described (e.g., Komher, J.S. et al . , 1989, 
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Nucl. Acids Res. 177779-7784; Sokolov, B.P., 1990, Nucl . 
Acids Res. 18:3671; Syvanen, A.-C. et al . , 1990, Genomics 
8:684-692; Kuppuswamy, M.N. et al . , 1991, Proc . Natl. Acad. 
Sci. U.S.A. 88:1143-1147; Prezant, T.R. et al . , 1992, Hum. 
5 Mutat. 2:159-164; Ugozzoli, L. et al . , 1992, GATA 9:107-112; 
Nyren, P. et al . , 1993, Anal. Biochem. 208:171-175; and 
Wallace W089/ 10414) . Each of these methods relies on the 
incorporation of labeled deoxynucleotides to discriminate 
between bases at a polymorphic site. An alternative 

10 microsequencing method, the Genetic Bit Analysis (GBA™) 

method has been disclosed by Goelet, P. et al . (WO 92/15712) 
which avoids many of the problems in the above identified 
microsequencing assays. In GBA rM , the nucleotide sequence 
information surrounding a predetermined site of interrogation 

15 is used to design an oligonucleotide primer that is 

complementary to the region immediately adjacent to, but not 
including, the predetermined site. The target DNA template 
is selected from the biological sample and hybridized to the 
interrogating primer. This primer is extended by a single 

20 labeled dideoxynucleotide using DNA polymerase in the 

presence of at least two, and most preferably all four chain 
terminating nucleoside triphosphate precursors. 

Several variations of the GBA method have been 
developed, as well as other microsequencing methods (see, 

25 e.g., Mundy, U.S. Patent No. 4,656,127; Vary and Diamond, 

U.S. Patent No. 4,851,331; Cohen, D. et al . , PCT Application 
No. WO91/02087; Chee, M. et al . , W095/11995; Landegren, U. et 
al . , 1988, Science 242:1077-1080; Nicerson, D.A. et al . , 
1990, Proc. Natl. Acad. Sci. U.S.A. 87:8923-8927; Pastinen, 

30 T. et al. (1997, Genome Res. 7:606-614; Pastinen, T. et al . , 
1996, Clin. Chem. 42 :1391-1397 ; Jalanko, A. et al . (1992, 
Clin. Chem. 38:39-43; Shumaker, J.M. et al . , 1996, Hum. 
Mutation 7:346-354; Caskey, C. et al . , WO 95/00669). 
Although they are simpler to perform and analyze than de novo 

35 sequencing, such microsequencing methods require primers that 
hybridize to the target nucleic acid molecule at a site 
immediately adjacent to a polymorphism (or a site suspected 
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of being next to a polymorphism) . Hence, such techniques 
require prior knowledge of a "wild type" nucleic acid 
sequence. Further, the techniques are limited to identifying 
a specific mutation or polymorphisms, typically a SNP, at a 
5 specific location in a specific nucleic acid sequence. 
Finally, such techniques also typically require multiple 
interrogations per target base. 

3 . SUMMARY OF THE INVENTION 
10 The present invention relates to a unique universal 

array system and its use for nucleic acid sequence analysis. 
The array system of the invention can be used for nucleic 
acid sequence analysis, for example to identify DNA and other 
nucleic acid sequences as well as mutations and polymorphisms 
15 in those sequences. 

The principles of the sequence analysis system and 
methods of the invention involve: (i) capture of the 
polynucleotide to be analyzed, referred to herein as the 
"template" by a specific "capture moiety" at a particular 

2 0 "capture region" on the template, (ii) scanning of the 

template by a combinatorial primer-polymerase complex for 
regions of complementarity, and (iii) signal generation 
through a template homology dependent primer extension 
reaction wherein the primer is extended by one or more 
25 nucleotides or nucleotide analogues. 

Because primer extension occurs only when a region of 
complementarity to the primer is encountered on the template, 
the extension signal generated in step (iii) is the key 
element that identifies the existence of a specific nucleic 

3 0 acid sequence in the template polynucleotide. If no sequence 

complementary to the combinatorial primer is present in the 
template, primer extension does not occur, and an extension 
signal is not generated. Thus, by performing the methods of 
the invention with an array comprising a plurality of 
35 combinatorial primers, a unique pattern or matrix of extended 
vs. non-extended primer signals is generated for the 
polynucleotide sequence being analyzed. 
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In a preferred embodiment, the array system of the 
invention comprises an array of sequence reagents arrayed on 
a solid support, and comprising (a) a capture moiety which 
can form a stable complex with a region of a template, (b) a 
5 spacer region, and (c) a primer region comprising 3-7 bases 
which, in the presence of a polymerase, can recognize 
complementary sequences in the template and can be extended 
by one or more nucleotides moieties (i.e., nucleotides or 
nucleotide analogues) by a template dependent primer 

10 extension reaction. Most preferably, the invention is 

practiced using a universal sequencing array, as describe by 
Head, S. et al . (U.S. Patent Application Serial No. 
08/976,427, filed November 21, 1997). 

Preferably, the nucleotide moieties used for template 

15 dependent primer extension are labeled nucleotides. In 
various embodiments, the nucleotide moieties may be, e.g., 
chain terminating nucleotides such as dideoxynucleotides , or 
nucleotides that are not chain terminating such as 
deoxynucleotides . Preferably, the nucleotide moieties 

2 0 comprise nucleotides or nucleotide analogues for all four of 
the nucleotide bases, A, G, C, and T. 

The invention is based, in part, on Applicants* 
discovery that nucleic acid sequences can be analyzed on 
solid phase arrays, referred to herein as sequence arrays, 

25 through the novel analysis of "on/off" polymerase extension 
signals on the arrays. The methods do not require that the 
actual identity of the polymerase extension product be 
determined, merely the determination of whether a polymerase 
extension product is or is not present for each element of 

30 the array. Subsequences of the template nucleic acid 

molecule can then be identified through simple analysis of 
these on/off polymerase extension signals, and mutations and 
polymorphisms in the template nucleic acid molecule are 
thereby detected . 

35 The present invention overcomes the limitations in the 

prior art by providing methods for the analysis of nucleic 
acid sequences which employ arrays of primer sequences that 
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are three to seven bases in length. Such primer arrays have 
no more than 4096 possible sequence combinations (for six 
bases) , and can successfully analyze polynucleotides of up to 
several thousand bases. In one preferred embodiment, such an 
5 array will have only 1024 sequence combinations (for five 
bases) , and can analyze the sequence of a polynucleotide 
having up to about 1000 bases. In another preferred 
embodiment, such an array will have only 256 sequence 
combinations (for four bases) , and can analyze sequences of 

10 up to about 3 00 bases. In certain other embodiments, it is 
possible to use arrays having even fewer sequence 
combinations. Such simple arrays are significantly easier to 
manufacture and screen than the solid phase arrays typically 
used for de novo sequencing techniques, which comprise 

15 millions of sequence combinations and/or employ sequences 
derived empirically from the target sequence. 

The methods of the present invention also provide a 
method for analyzing nucleic acid sequences wherein a 
"binary" polymerase extension signal is generated for each 

2 0 primer in the array. Thus, signal analysis simply comprises 

detecting the presence or absence of a polymerase extension 
product. Neither identification of the extension product, nor 
discrimination between the different extension products is 
required. 

25 Signal analysis is further simplified in the present 

invention in that analysis of the resulting signals simply 
comprises analyzing the binary signal pattern or "matrix" of 
the primer array. The pattern is then used to determine the 
presence or absence of a mutation, a polymorphism, or the 

3 0 sequence of a target nucleic acid molecule. Thus, the 

methods of the invention utilize a pattern matching approach 
to sequence analysis, thereby reducing or even eliminating 
the need for complicated or intensive methods of analysis. 
The present invention further improves upon the prior 
35 art in that no prior knowledge of the nucleic acid sequence 
to be analyzed is required to practice the methods of the 
invention. The methods of the invention can also be used to 
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detect any mutation or polymorphism at any location in a 
nucleic acid molecule, and are not limited to assaying for a 
particular mutation or SNP. Thus, the methods of the present 
invention are not limited to merely determining the identity 
5 of a nucleotide base at a specific position in a nucleic acid 
molecule. Examples of mutations and polymorphisms which can 
be detected by the present invention include, but are by no 
means limited to, single nucleotide polymorphisms (SNP's), as 
well as base deletions, base insertions, and heterozygous 

10 mutations and polymorphisms. 

The solid phase arrays used by the methods of the 
invention are simple to produce, and analysis of the 
resulting on/off polymerase extension signals is simple and 
straightforward, reducing the need for complicated, intensive 

15 methods of analysis. Indeed, the PSA methods of the 

invention are ideally suited for simple pattern matching 
methods of analysis. 

4 . BRIEF DESCRIPTION OF THE DRAWINGS 
2 0 FIG. 1 is a diagrammatic representation of the structure 

of the sequence reagent used in the methods of the present 
invention. 

FIGS. 2A-C illustrate the steps of the polymerase 
signaling assay (PSA) of the present invention; FIG. 2A 

25 illustrates the "capture" step wherein the capture region of 
the template nucleic acid molecule forms a stable complex 
with the capture moiety of a sequence reagent; FIG. 2B 
illustrates the "scan" step wherein the primer region of the 
sequence reagent recognizes a complementary region of the 

30 template nucleic acid molecule; FIG. 2C illustrates the 

"signal" step wherein the primer is extended by one or more 
nucleotides . 

FIGS. 3A-D illustrates the expected (FIG. 2A, FIG. 2C) 
and experimental (FIG. 3B, FIG. 3D) data signals from PSA 
35 analysis of the "wild type" and "mutant" nucleic acid 

sequences shown in Table 1, using a sequence array comprising 
the 21 primer sequences listed in Table 2. 
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FIGS. 4A-C illustrates data signal patterns predicted 
for PSA analysis of p53 exon 8; FIG. 4A illustrates the PSA 
data signal pattern for a wild type p53 template; FIG. 4B 
illustrates the PSA data signal pattern of a p53 template 
5 having a single nucleotide polymorphism (SNP; 38C -» T) ; FIG. 
4C illustrates the difference between the two data signal 
patterns shown in FIGS. 4A-B. 

FIGS. 5A-C illustrates predicted data signal patterns 
for PSA analysis of p53 exon 8; FIG. 5A illustrates the data 
10 signal pattern of a wild type p53 template; FIG. 5B 

illustrates the data signal pattern of a p53 template having 
the SNP 38C - A; FIG. 5C illustrates the difference between 
the two data signal patterns shown in FIGS. 5A-B. 

FIGS. 6A-C illustrates predicted data signal patterns 
15 for PSA analysis of p53 exon 8; FIG. 6A illustrates the data 
signal pattern obtained for a wild type p53 template; FIG. 
6B illustrates the data signal pattern of a p53 template 
having a the heterozygous polymorphism 38C -»■ A; FIG. 6C 
illustrates the difference between the two data signal 
2 0 patterns shown in FIGS. 6A-B. 

FIGS. 7A-C illustrates predicted data signal patterns 
for PSA analysis of p53 exon 8; FIG. 7A illustrates the data 
signal pattern of a wild type p53 template; FIG. 7B 
illustrates the data signal pattern for PSA analysis of a p53 
25 template having a five base deletion; FIG. 7C illustrates 
the difference between the two data signal patterns shown in 
FIGS. 7A-B. 

FIGS. 8A-C illustrates predicted data signal patterns 
for PSA analysis of p53 exon 8; FIG. 8A illustrates the data 
30 signal pattern obtained for a wild type p53 template; FIG. 
8B illustrates the data signal pattern for PSA analysis of a 
p53 template having a five base insertion; FIG. 8C 
illustrates the difference between the two data signal 
patterns shown in FIGS. 6A-B. 

35 

5. DETAILED DESCRIPTION OF THE INVENTION 
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The present invention includes methods of an assay 
referred to herein as the polymerase signaling assay or PSA. 
These methods are used for analyzing the sequence of nucleic 
acid molecules. A nucleic acid molecule which is sequence 
5 analyzed by the methods of this invention is defined herein 
as a template nucleic acid molecule, or template molecule, or 
template. Template nucleic acid molecules which may be 
sequence analyzed by the methods of the invention include DNA 
molecules, such as, but by no means limited to, genomic DNA 
10 molecules, cDNA molecules, and fragments thereof. Template 
nucleic acid molecules which may be sequence analyzed by the 
methods of the invention also include RNA molecules, such as, 
but by no means limited to, messenger RNA (mRNA) molecules, 
ribosomal RNA ( r RN A ) molecules, and fragments thereof. 

15 

The nucleic acid molecule to be analyzed by the PSA 
methods of this invention may be from any source. For 
example, the template nucleic acid molecule may be a 
naturally occurring nucleic acid molecule such as a genomic 

20 or extragenomic DNA molecule isolated from an organism, or an 
RNA molecule, such as an mRNA molecule, isolated from an 
organism. Alternatively, the template nucleic acid molecule 
may be synthesized, including, e.g., nucleic acid molecules 
synthesized enzymatically in vivo or in vitro, such as, for 

25 example, a cDNA molecule, or a nucleic acid molecule 

synthesized by PCR. The sample of nucleic acid molecules can 
comprise, e.g., molecules of deoxyribonucleic acid, 
ribonucleic acid, or copolymers or deoxyribonucleic acid and 
ribonucleic acid. 

30 The invention is based, in part, on the discovery by 

applicants that nucleic acid sequences can be analyzed on 
solid phase arrays, referred to herein as sequence arrays, 
through the novel analysis of "on/off" polymerase extension 
signals on the arrays. The methods do not require that the 

35 actual identity of the polymerase extension product be 

determined, merely the determination of whether a polymerase 
extension product is or is not present for each element of 

- 10 - PENY4-712720.1 



the array. Subsequences of the template nucleic acid 
molecule can then be identified through simple analysis of 
these on/off polymerase extension signals, and mutations and 
polymorphisms in the template nucleic acid molecule are 
5 thereby detected. 

The solid phase arrays used by the methods of the 
invention are simple to produce, and analysis of the 
resulting on/off polymerase extension signals is simple and 
straightforward, reducing the need for complicated, intensive 
10 methods of analysis. Indeed, the PSA methods of the 

invention are ideally suited for simple pattern matching 
methods of analysis. 

The following subsections present the methods of the 
invention in greater detail. in particular, Section 5.2 
15 first describes the sequence arrays used in the methods of 
the present invention, as well as methods for preparing such 
arrays. The methods of the polymerase signaling assay are 
then described, in detail, in Section 5.3. 

20 5.1. SEQUENCE ARRAYS 

Sequence arrays such as those used in the methods of the 
present invention are described, in detail, by Head, S. et 
al. (U.S. Application Serial No. 08/976,427, filed November 
21, 1997) , which is incorporated herein by reference in its 

25 entirety. Such sequence arrays are also discussed below, as 
they pertain specifically to the PSA methods of the present 
invention. 

The sequence arrays of the present invention comprise 
30 solid phase arrays of combinatorial oligonucleotides. Such 
arrays comprise a plurality of sequencing reagents 
immobilized, either individually or in a group, to a solid 
surface in a spatially distinct fashion. The sequence 
reagents of the sequence arrays each comprise (i) a "capture 
35 moiety", (ii) a "spacer region", and (iii) a combinatorial 
primer, or "primer region" which is also referred herein as 
the primer sequence. The methods of the present invention 
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utilize sequence arrays wherein the primer of each sequencing 
reagent preferably comprises from four to six bases which can 
recognize a complementary sequence of a template nucleic acid 
molecule. 

5 

5.1.1. THE SEQUENCE REAGENT 
The sequence reagents of the sequence arrays used in the 
PSA methods of the present invention are designed for use as 
part of a combinatorial array for primer extension-based 
10 sequence analysis of template nucleic acid molecules. A 
specific, preferred embodiment of the sequence reagent is 
illustrated in FIG. 1. The individual components of the 
sequence reagent, which are illustrated in FIG. 1, are 
described below. 

15 

Attachment Moiety : 

Under one preferred embodiment, the sequence reagent 
includes an optional "attachment moiety" (FIG. 1; AM) which 
is coupled to one terminus of the sequencing reagent, 

20 preferably the 5' -terminus, and functions to attach the 
sequence reagent to the solid surface. Preferably the 
attachment moiety specifically attaches the sequence reagent 
to the solid surface. 

In an alternative embodiment, the sequence reagent is 

25 non-specif ically attached to the solid surface. The sequence 
reagent can be non-specif ically attached to the solid 
surface, e.g., by means of a cationic agent, such as octyl- 
dimethylamine HC1 or NaCl. Alternatively, the sequence 
reagent can be non-specif ically attached to a charged 

30 surface, such as an amino modified solid surface. 

Under another preferred embodiment, the sequencing 
reagent is specifically attached to the solid surface. 
Preferably, the specific attachment of the sequence reagent 
is by means of a reversible bond. 

35 Under one preferred embodiment, the sequencing reagent 

can be specifically attached to the solid surface by means of 
a non-covalent bond. For example, a biotin or iminobiotin 
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labeled oligonucleotide may be immobilized to an avidin or 
strepavidin coated solid surface. Alternatively, a 
haptenated oligonucleotide may be immobilized to an antibody 
coated solid surface. However, it is to be understood that 
5 other ligand receptor interactions are suitable for use in 
the present invention. 

Under another preferred embodiment, the sequencing 
reagent is specifically attached to the solid surface by 
means of a covalent bond. Preferably, the covalent bond is a 

10 disulfide bond. Additional embodiments for attaching the 
sequencing reagent to the solid surface are discussed below 
with respect to the capture moiety. The various embodiments 
for immobilizing a nucleotide to a solid surface discussed 
with respect to the capture moiety, described below, can also 

15 be applied to immobilize the present sequencing nucleotides 
to a solid surface. 

Exemplary attachment moieties suitable for use in the 
present invention therefore include the incorporation of an 
amino, thiol, disulfide, biotin, etc. group at the 5'- 

2 0 terminus of the sequence reagent. This modification can be 

done either at the time the sequence reagent is synthesized, 
or after synthesis of the sequence reagent. 

Capture Moiety : 
25 The "capture moiety" (FIG. 1; CM) of the sequence 

reagent is a moiety which is capable of forming a stable 
complex with a region of the template nucleic acid molecule, 
referred to herein as the "capture region" of the template 
molecule. The capture moiety can be near either the 5' or 3' 

3 0 terminus of the sequence reagent. 

The capture moiety can be, e.g., a DNA, RNA, or PNA 
(protein nucleic acid) sequence. The nucleic acid sequence 
may also contain modified bases. For example, a RNA sequence 
may contain 2'-0-methoxy RNA, and a DNA sequence may contain 
35 5-Me-dC, pdC, pdU, or 2-amino-dA. Under another embodiment, 
the nucleic acid sequence may contain a modified backbone 
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wherein the backbone is modified with phosphorothioate, 
phosphordithioate, methylposphonate, or H-phosphonate . 

Under another preferred embodiment, the capture moiety 
may be biotin, iminobiotin, avidin, strepavidin, antibody, 
5 hapten, a receptor, a ligand, or a charged base. Receptors 
and ligands suitable for use in the capture moiety include, 
but are not limited to, protein A, protein G, the Fc portion 
of an antibody, or Fc receptor. 

The capture moiety can also form a stable complex with 
10 the capture region of the template nucleic acid molecule, 
e.g., by means of a disulfide bond, a covalent ether or 
thioether linkage via an epoxy, UV cross-linkage, a 
condensation reaction with a carbodiimide, a 

bromoacetyl/thiol linkage to a thioester, a crosslinkage with 

15 a bi-functional group, or a complex between thiol and gold. 
Bi-f unctional crosslinking reagents suitable as capture 
moieties in the present invention include, but are not 
limited to, an imidiester, N-hydroxysuccinimidyl ester, 
malemide, alkyl halide, aryl halide, alpha-haloacyl , and 

20 pyridyl disulfide. The capture moiety can also be covalently 
attached, e.g., by means of a labeling group. Labeling 
groups suitable for use in the present invention include, but 
are not limited to, amino, sulfhydryl, disulfide, phosphate, 
thiophosphate, dithiophosphate , and psoralen groups. 

25 The capture moiety can also exist as a separate molecule 

that is co-attached to the solid phase and effectively brings 
the template into close proximity to the primer region of the 
sequence reagent (FIG. 1; PR) . 

In one of the preferred embodiments, the capture moiety 

30 comprises a sequence of 8-24 cytosine bases which can 

hybridize to a sequence of 8-24 guanine bases incorporated on 
the template strand. In another preferred embodiment, the 
capture moiety comprises a specific sequence complementary to 
a PCR primer, or portion thereof, used to amplify a region of 

35 the template strand. For example, the capture moiety can 

comprise a sequence complementary to a restriction site found 
within the PCR primer. Alternatively, the PCR primer 
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hybridizes to a promoter site and thus, the capture sequence 
is the same sequence as a promoter site. PCR primer design 
is well known to those of skill in the art and it is 
appreciated that the capture sequence can be complementary to 
5 a PCR primer, to a portion thereof, or to a modification 
thereof . 

The efficiency of hybridization and the thermal 
stability of hybrids formed between the target nucleic acid 
and a short oligonucleotide probe depends strongly on the 

10 nucleotide sequence of the probe and the stringency of the 
reaction conditions (see, e.g., Conner, B.J. et al . , 1983, 
Proc. Natl. Acad. Sci . U.S.A. 80:278-282). Appropriate 
hybridization conditions for specific embodiments of the 
capture moiety and capture region will therefore be apparent 

15 to those skilled in the art, in view of the above and other 
references well known in the art. 



Spacer Region : 

The sequence reagent additionally comprises a spacer 

2 0 region (FIG. 1; SR) . Preferably, the spacer region is at 
least 10 nm in length, more preferably 10-100 nm in length. 
However, the spacer region can also be greater than 100 nm 
length. Spacer regions suitable for use in the present 
invention include, but are not limited to, DNA or RNA 

25 sequence, PNA sequence, polyethylene glycol groups, 5-nitro- 
indole groups, or other chemical spacer arms. The spacer 
region can also consist of analogues of DNA, RNA, and PNA. 
In such embodiments, the nucleic acid sequences of the spacer 
region may comprise unmodified or modified nucleotide bases, 

30 such as the modified bases describe above for the capture 
moiety. Preferably, the spacer region consists of a random 
sequence of bases. However, the spacer region can also 
consist of a pseudo-random or non-random sequence of bases. 
The spacer region is preferably designed to minimize 

35 template independent noise. Template independent noise is 
the result of signal detection independent (i.e., in the 
absence) of the template molecule. Under one embodiment, a 
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spacer region is additionally placed in between the capture 
moiety and the attachment moiety. 

Primer Region ; 

5 Finally, the sequence reagent also comprises a primer 

region (FIG. 1; PR) , also referred to herein as the primer. 
The primer region consists of specific bases which can 
recognize sequences on the template strand, and can be 
extended by a polymerase with one or more labeled 

10 nucleotides. In particular, as used in the PSA methods of 
the present invention, the primer consists of four to six 
specific bases which can recognize complementary sequences on 
the template strand. Generally, the primer will comprise a 
DNA or RNA sequence four to six of the naturally occurring 

15 nucleotide bases. Such sequences are also referred to herein 
as the primer sequences of the sequence reagent. 

The primer sequence may be, e.g., an 
oligodeoxyribonucleotide, and oligoribonucleotide, or a 
copolymer of deoxyribonucleotides and ribonucleotides. The 

2 0 primer sequence can be synthesized either enzymatically in 
vivo, enzymatically in vitro, or non-enzymatically in vitro, 
e.g., by the methods described in Section 5.1.2 below. 

The primer sequence, or alternatively the sequence 
reagent, may also be labeled with a detectable marker or 

25 label, including the detectable labels described in Section 
5.2.4, below. Preferably the detectable label of marker used 
to label the primer or sequence reagent will be different 
from the labels used to label either the extended primer, or 
the template. 

30 

5.1.2. SYNTHESIS OF A SEQUENCING REAGENT 
There are two preferred methods for making the 
sequencing reagents of a sequence array. The first method is 
to synthesize the specific oligonucleotide sequences directly 
35 on the solid-phase (in situ) in the desired pattern (i.e., in 
the desired spatially distinct fashion), as described, e.g., 
by Southern et al . (1994, Nucl . Acids Res. 22:1368-1373), by 
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Maskos et al . (1992, Nucl . Acids Res. 20:1679-1684), and by 
Pease et al . (1994, Proc . Natl. Acad. Sci. U.S.A. 92:5022- 
502 6) . The other method is to first pre-synthesize the 
oligonucleotides in an automated DNA synthesizer, such as an 
5 ABI 392, and to then attach the synthesized oligonucleotides 
onto the solid-phase at specific locations (see, e.g., 
Lamture et al . , 1994, Nucl. Acids Res. 22 : 2 12 1-2 12 5 ; and 
Smith et al . , 1994, Nucl. Acids Res. 22:5456-5465). 

In the first method, the efficiency of the coupling step 

10 of each base will affect the quality and integrity of the 
nucleic acid molecule array. This method generally yields a 
large percentage of undesired incomplete (i.e., shortened) 
sequence which can create problems in the analysis step, and 
thereby effect the integrity of the analysis. Thus, the 

15 quality and integrity of an array synthesized according to 
the first method is inversely proportional to the length of 
the nucleic acid molecule. Specifically, the synthesis of 
longer oligonucleotides results in a higher percentage of 
incomplete, shortened sequences. 

2 0 The second, more preferred method for nucleic acid array 

synthesis utilizes an automated DNA synthesizer for DNA 
synthesis. Oligonucleotides are synthesized using standard 
phosphoramidite chemistry (see, e.g., Matteucci, M.D. et al . , 
1981, J . Amer. Chem. Soc. 103:3185-3191). Preferably, a 

25 segmented synthesis strategy is used to simultaneously 
synthesize large numbers of oligonucleotides (see, e.g., 
Beattie, K.L. et al . , 1988, Appl . Biochem. Biotechnol . 
20:510-521; Beattie, K.L. et al . , 1991, Nature 352:548-549). 
The controlled chemistry of an automated DNA synthesizer 

30 allows for the synthesis of longer, higher quality DNA 

molecules than is possible with the first method. Also, the 
nucleic acid molecules synthesized according to the second 
method can be purified prior to the coupling step. 
Therefore, the quality of the nucleic acid molecule array can 

35 be expected to be much higher than the quality of the nucleic 
acid array of the first method. 
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5.1.3. IMMOBILIZATION TO A SOLID PHASE 
Several methods have been proposed as suitable for 
immobilizing an oligonucleotide to a solid support. For 
example, Holmstrom, K. et al . (1993, Anal. Biochem. 209:278- 
5 283) exploit the affinity of biotin for avidin and 

streptavidin , and immobilize biotinylated nucleic acid 
molecules to avidin/streptavidin coated supports. Another 
method requires the pre-coating of the polystyrene or glass 
solid phases with poly-L-Lys or poly L-Ly,Phe, followed by 

10 the covalent attachment of either amino- or sulfhydryl- 
modified oligonucleotides using bi-f unctional crosslinking 
reagents. Both methods require the use of modified 
oligonucleotides as well as a pretreatment of the solid phase 
(see, e.g., Running, J. A. et al . , 1990, BioTechniques 8:276- 

15 277; Newton, C.R. et al . , 1993, Nucl . Acids Res. 22:1155- 
1162) . 

Kawai, S. et al . (1993, Anal. Biochem. 209:63-69) 
describes an alternative method in which short 
oligonucleotide probes were ligated together to form 

2 0 multimers, and these were ligated into a phagemid vector. 

The oligonucleotides were immobilized onto polystyrene plates 
and fixed by UV irradiation at 254 nm. A method for the 
direct covalent attachment of short, 5 ' -phosphorylated 
primers to chemically modified polystyrene plates ("Covalink" 

25 plates, Nunc) has also been proposed by Rasmussen, S.R. et 
al. (1991, Anal. Biochem. 198 : 138-142) . The covalent bond 
between the modified oligonucleotide and the solid phase 
surface is created by a condensation reaction with a water- 
soluble carbodiimide. The Rasmussen method claims a 

30 predominantly 5 ' -attachment of the oligonucleotides via their 
5 ' -phosphates ; however, it requires the use of specially 
prepared, expensive plates. 

Maskos, U. et al . (1992, Nucl. Acids Res. 20:1679-1684) 
describes a method to synthesize oligonucleotides directly 

35 onto a glass support. According to this method, a flexible 
linker with a primary hydroxyl group is bound to the solid 
support via a glycidoxypropyl silane, wherein the primary 
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hydroxyl group serves as the starting point for the 
oligonucleotide synthesis. The disadvantages of this method 
are that the reaction is not reversible and the 
oligonucleotides leak from the solid surface during 
5 manipulation. 

Covalent disulfide bonds have been previously used to 
immobilize both proteins and oligonucleotides. For example, 
Carlsson, J. et al . (1991, Biotech. Applied Biochem. 14:114- 
120) discloses a method for the reversible immobilization of 

10 thiolated proteins and peptides to an agarose bead by means 
of a disulfide bond. In that method, the disulfide bond is 
formed between a thiol containing protein and a thiol- 
derivatized agarose bead. The reference also discloses that 
the disulfide bond is reversible in the presence of an excess 

15 of dithiothreitol . Chu, B.C.F. et al . (1988, Nucl . Acids 
Res. 16:3671-3691) discloses a method for coupling 
oligonucleotides to nucleic acids or proteins via cleavable 
disulfide bonds. Prior to the coupling reaction, the 
oligonucleotides are modified by adding a cystamine group to 

2 0 the 5* phosphate by means of a phosphoramadite bond. 
Sliwkowski, M.X. et al. (1983, Biochem. J. 209:731-739) 
discloses a method of covalent chromatography wherein 
proteins are immobilized to cysteinylsuccinimidoproyl glass 
beads through reversible disulfide bond interaction. 

25 Fahy, E. et al . (1993, Nucl. Acids Res. 21:1819-1826) 

describes the synthesis of 5 ' -bromacetyl and 5 '-thiol 
oligonucleotide derivatives and the covalent immobilization 
of these oligonucleotide derivatives via thioester bonds to 
sulfhydryl- and bromacety 1-modif ied polyacrylamide supports. 

30 The disadvantage of this method is that the covalent bond is 
not reversible. 

Anderson et al . (U.S. Serial No. 08/812,010, filed on 
March 5, 1997) describes a novel method for immobilizing 
nucleic acid molecules to a solid-phase by means of a 

35 reversible, covalent disulfied bond. In that method, a 
disulfide bond is formed between a thiol or disulfide 
containing nucleic acid molecule and a mercaptosilane coated 
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solid surface. Shi et al . (U.S. Serial No. 08/870,010) 
describes a novel method for immobilizing nucleic acid 
molecules to a solid phase by means of a covalent ether or 
thioether linkage. These simple, two-step methods have the 
5 specificity and efficiency needed to prepare DNA arrays. 

All of the above described methods can be used in the 
present invention to immobilize the seguence reagent to the 
solid support, the preferred embodiments are those disclosed 
by Anderson et al . (supra), and by Shi et al . (supra). An 

10 additional preferred embodiment for immobilizing the 

seguencing reagent is to immobilize biotinylated nucleic acid 
molecules to avidin/streptavidin coated supports as disclosed 
by Holmstrom, K. et al. (1993, Anal. Biochem. 209:278-283). 
Although any of a variety of glass or plastic solid 

15 supports can be used in accordance with the methods of the 
present invention, silicon glass is the preferred solid 
support. Preferably, the solid support is fashioned with 
array densities greater than 1000 elements/cm 2 . The support 
can also be fashioned as a bead, dipstick, test tube, pin, 

2 0 membrane, channel, capillary tube, column, or as an array of 
pins or glass fibers. Preferably, the plastic support is a 
form of polystyrene plastic. Alternatively, the solid 
support can be glass, preferably in the form of a microscope 
slide, coverslip, a capillary tube, a glass bead, or a 

2 5 channel. The solid support can also be a glass plate, a 
guarts wafer, a nylon or nitrocellulose membrane, or a 
silicon wafer. 

5.1.4. ARRAY FORMATION 
30 In the methods of the present invention, the sequence 

reagents are intended to be made into an array. As used 
herein, an array is an orderly, spatially dependent 
arrangement of sequence reagents, as in a matrix of rows and 
columns, or a spatially addressable or separable arrangement 
35 such as with coated beads. Preferably, the array is an array 
of permutations of the primer sequences, such as all possible 
3mers, 4mers, 5mers, 6mers, 7mers, or combinations thereof. 
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With an automated delivery system, such as a Hamilton 
robot or ink- jet printing method, it is possible to form a 
very complex array of oligonucleotide primers on a solid 
support, in particular an epoxysilane, mercaptosilane , or 
5 disulf idesilane coated solid support. Such methods can 

deliver nano or pico-liter size droplets with sub-millimeter 
spacing. Because the aqueous beads are extremely well 
defined, it is possible to create an array with an extremely 
high density of oligonucleotide primers. Thus, it is 

10 possible to create arrays having greater than about 10,000 

primer droplets/cm 2 . Such arrays can be assembled through the 
use of a robotic liquid dispenser, such as an ink-jet 
printing device controlled by a piezoelectric droplet 
generator, such that each nucleic acid molecule occupies a 

15 spot of more than about 10 microns, preferably more than 25 
microns in diameter, and such that each nucleic acid spot is 
spaced no closer, center to center, than the average spot 
diameter. Methods and apparatuses for dispensing small 
amounts of fluids using such ink- jet printing techniques and 

20 piezoelectric ink-jet depositions have been previously 
describe, e.g., by Wallace, D.B. et al . (U.S. Patent No. 
4,812,856) by Hayes, D.J. et al . (U.S. Patent No. 5,053,100). 

Under one embodiment, the array can be constructed using 
the method for Fodor, S.P. et al . (U.S. Patent No. 

25 5,445,934). Fodor et al . describes a method for constructing 
an array onto a solid surface wherein the surface is covered 
with a photoremovable group. Selected regions of the 
substrate surface are exposed to light so as to activate the 
selected regions. A monomer, which also contains a photo- 

30 removable group, is provided to the substrate surface to bind 
to the selected area. The process is repeated to create an 
array . 

Under another preferred embodiment, the array can be 
created by means of a "gene pen". A gene pen, as used 
35 herein, refers to a mechanical apparatus comprising a 

reservoir for a reagent solution connected to a printing tip. 
The printing tip further comprises a means for mechanically 
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controlling the solution flow. Under one embodiment, a 
multiplicity of gene pens or printing tips may be tightly 
clustered together into an array, with each tip connected to 
a separate reagent reservoir. Under another embodiment, 
5 discrete gene pens may be contained in an indexing turntable 
and printed individually. Typically, the solid surface is 
pretreated to enable covalent or non-covalent attachment of 
the reagents to the solid surface. Preferably, the printing 
tip is a porous pad. 

10 Alternatively, the array can be created with a manual 

delivery system, such as a pipetman. Because these arrays 
are created with a manual delivery system, these arrays will 
not, in general, be as complex as those created with an 
automated delivery system. Arrays created with a manual 

15 delivery system will typically be spaced, center to center, > 
2 mm apart. Depending on the delivery system employed, it is 
possible to create arrays spaced, center to center, with > 2 
mm spacing, 0.5-2 mm spacing, 50-500 /Ltm spacing, or < 50 /m 
spacing. 

20 

5.2. POLYMERASE SIGNALING ASSAY 
The present invention provides methods for analyzing the 
sequence of a template nucleic acid molecule; i.e., for the 
detection and/or identification of particular nucleic acid 

25 sequences and/or subtypes thereof in a sample. Specifically, 
the invention provides methods for an assay, referred to 
herein as the polymerase signaling assay or PSA, for 
analyzing the sequence of a template nucleic acid molecule 
using the sequence arrays described in Section 5.2, above. 

30 The template nucleic acid molecule to be analyzed by the PSA 
methods of the invention may be from any source. For 
example, the template nucleic acid molecule may comprise a 
naturally occuring nucleic acid molecule such as a genomic or 
extragenomic DNA molecule isolated from an organism, or an 

35 mRNA molecule isolated from an organism. Alternatively, the 
template nucleic acid molecule may comprise a synthesized 
nucleic acid molecule, include a nucleic acid molecule 
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synthesized enzymatically in vivo or in vitro. For example, 
the template nucleic acid molecule may comprise a cDNA 
molecule, or a nucleic acid molecule synthesized by PCR. 

The template nucleic acid molecule may additionally be 
5 labeled with a detectable label, including the detectable 
lables discribed in Section 5.3.4, below. Preferably, the 
detectable lable used to label the template nucleic acid 
molecule will be different from the label used to label the 
nucleotide or nucleotide analog for the primer extension 

10 reaction, so that the two moieties, i.e., the template 
molecule and the extended primer, can be readily 
distinguished from one another. Likewise, the detectable 
label used to label the template should preferably be 
different and distinct from any label used to label the 

15 primer sequence or the sequence reagent. 

Preferably, the template nucleic acid molecule analyzed 
by the methods of this invention is a single stranded nucleic 
acid molecule, i.e., a single stranded template nucleic acid 
molecule or single stranded template. Accordingly, in 

20 embodiments wherein the initially provided is not single 

stranded, e.g., wherein a double stranded or triple stranded 
template nucleic acid molecule is initially provided, it is 
preferableto first treat the sample containing the template 
nucleic acid molecule so that a single stranded template 

25 nucleic acid molecule is thereby provided. However, the 
presence of an additional strand or strands does not 
necessarily have an adverse affect upon the methods of the 
invention. Accordingly, in other embodiments the template 
nucleic acid molecule may comprise nonsingle-stranded, e.g., 

30 double- or triple-stranded, nucleic acid molecules. 

The template nucleic acid molecule additionally 
comprises a capture region, as described in Section 5.2, 
above, for the capture moiety. The capture region is capable 
of attaching itself to the capture moiety of the sequence 

35 reagents. The general steps of the polymerase signaling 

assay are illustrated in FIG. 2A-C. The steps are described 
below, first briefly, and then in detail. 
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First, during template capture (FIG. 2A) , the template 
nucleic acid molecule is contacted with the sequence reagents 
of the sequence array under conditions such that the capture 
region of the template molecule forms a stable complex with 
5 the capture moiety of a sequence reagent of the array. 
Second, in the template scanning (FIG. 2B) the template 
nucleic acid molecule and sequence reagent are incubated 
under conditions such that the primer region of the sequence 
reagent can identify complementary sequences, if any are 

10 present, in the template nucleic acid molecule. Finally, the 
primer is extended (FIG. 2C) by one or more additional 
nucleotides by a template dependent primer extension reaction 
mediated, e.g., by a DNA or RNA polymerase. The added 
nucleotides are detected to determine whether primer 

15 extension has occurred or not. The above methods are 

repeated for each sequence reagent of the sequence array, and 
the resultant pattern of primer extensions is then analyzed, 
e.g., by pattern comparison methods, to analyze the sequence 
of the template nucleic acid molecule. 

2 0 It will be apparent to those of skill in the art, that 

in the most preferred embodiment the steps of the invention 
are repeated by contacting a sample comprising a plurality of 
the template nucleic acid molecule to be analyzed to the 
sequence array such that a template molecule is captured by 
25 each sequence reagent of the array, and polymerase extension 
is detected simultaneously for each sequence reagent. 
However, for simplicity the methods of the invention are 
described in terms of contacting a single template nucleic 
acid molecule to a single sequence reagent of an array. The 

3 0 steps of the PSA method of the invention a discussed 

individually, and in more detail below. 



5.2.1. GENERATION OF SINGLE STRANDED TEMPLATES 
Preferably, the nucleic acid template molecules to be 
35 analyzed by the methods of the present invention will be 

provided as single stranded nucleic acid molecules. However, 
in certain embodiments template nucleic acid molecules may be 
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provided that are not, initially, single stranded. For 
example, the template nucleic acid molecules to be analyzed 
by PSA may be double stranded, or triple stranded nucleic 
acid molecules. In such instances, the presence of an 
5 additional strand does not necessarily affect the polymerase 
signal assay of the invention. Thus, the methods of the 
present invention may be practiced on either double-stranded, 
or on single stranded DNA obtained, for example, by alkali 
treatment of native, double stranded DNA. 

10 Where desired, however, any of a variety of methods can 

be used to eliminate one of the two natural strands of the 
target DNA molecule from the reaction. Single stranded DNA 
molecules may be produced using the single-stranded 
bacteriophage M13 (see, e.g., Messing, J. et al . , 1983, Meth. 

15 Enzymol . 101:20; Samrook, J. et al . , 1989, In: Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY) . 

Several alternative methods can be used to generate 
single-stranded DNA molecules. Gyllensten, U. et al . (1988, 

20 Proc. Natl. Acad. Sci . U. S .A . 85 : 7652-7656) and Mihovilovic , 
M. et al . (1989, Biotechniques 7:14) describe a method, 
termed "asymmetric PCR" , in which the standard PCR method is 
conducted using primers that are present in different 
molecular concentrations. Higuchi, R.G. et al . (1985, Nucl . 

25 Acids Res. 17:5865) exemplifies an additional method for 
generating single-stranded amplification products. The 
method entails phosphorylating the 5' -terminus of one strand 
of a double-stranded amplification product, and then 
permitting a 5 ' -+ 3 ' exonuclease to preferentially degrade 

30 the phosphorylated strand. 

Other methods have also exploited the nuclease resistant 
properties of phosphorothioate derivatives in order to 
generate single-stranded DNA molecules (see, e.g., Benkovic 
et al., U.S. Patent No. 4,521,509; Sayers, J.R. et al . , 1988, 

35 Nucl. Acids Res. 25:791-802; Eckstein, F. et al . , 1976, 
Biochemistry 15:1685-1691; and Ott, J. et al . , 1987, 
Biochemistry 26:8237-8241) . 

— 25 — PENY4-712720.1 



Most preferably, such single-stranded molecules will be 
produced using the methods described by Nikiforov, T. (U.S. 
Patent No. 5,518,900). In brief, these methods employ 
nuclease resistant nucleotide derivatives, and incorporate 
5 such derivatives, by chemical synthesis or enzymatic means, 
into primer molecules, or their extension products, in place 
of naturally occurring nucleotides. 

Suitable nucleotide derivatives include derivatives in 
which one or two of the non-bridging oxygen molecules of the 

10 phosphate moiety of a nucleotide has been replaced, e.g., 

with a sulfur-containing group such as a phosphorothioate, an 
alkyl group such as a methyl or ethyl alkyl group, a 
nitrogen-containing group such as an amine, and/or a 
selenium-containing group. Phosphorothioate 

15 deoxyribonucleotide or ribo-nucleotide derivatives are the 
most preferred nucleotide derivatives. Methods of producing 
and using such phosphorothioate derivatives are disclosed by 
Nikiforov, T. (U.S. Patent No. 5,518,900). 

20 5.2.2. TEMPLATE CAPTURE 

Once a template nucleic acid molecule and sequence array 
have been provided, the first step of the PSA methods of the 
present invention is to contact the sample containing the 
template molecule with the sequence array under conditions 

25 such that the capture region of the template nucleic acid 
molecule forms a stable complex with the capture moiety of a 
sequence reagent. In particular, the capture region of the 
template molecule must form a stable complex with the capture 
moiety of the sequence reagent so that the template molecule 

30 is immobilized to the sequence reagent of the solid-phase 
sequence array. 

The exact conditions necessary will be apparent to one 
of skill in the art for particular embodiments of the capture 
moiety and capture region. For example, in a particularly 

35 preferred embodiment, the capture moiety is a sequence of 8 
to 24 cytosine bases which hybridizes to a capture region of 
8 to 24 guanine bases incorporated in the template nucleic 
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acid molecule. In another preferred embodiment, the capture 
moiety can be a specific sequence complementary to a PCR 
primer, or portion thereof, used to amplify a region of the 
template strand. Thus, in such embodiments appropriate 
5 hybridization conditions would be conditions of high 
stringency, e.g., incubation at a final template 
concentration of 100 nM in 1.5 M NaCl at room temperature for 
1 hour, followed by multiple washings in IX TNTw (Tris pH 8, 
Tween 0.5%, 150 mM NaCl). 

10 In general, the efficiency of hybridization and the 

thermal stability of hybrids formed between such embodiments, 
wherein the capture region and capture moiety comprise a 
target nucleic acid sequence and a short oligonucleotide 
primer, respectively, depends strongly on the nucleotide 

15 sequence of the primer, and on the stringency of the reaction 
conditions as discussed, e.g., in Conner, B.J. et al . , supra. 
Appropriate hybridization conditions for specific embodiments 
will therefore be apparent to those skilled in the art in 
view of the above and other references well known in the art. 

20 

5.2.3. TEMPLATE SCANNING 
After the template nucleic acid molecule has been 
immobilized to the sequence reagent as described above, the 
template-reagent complex is incubated under conditions which 

25 allow duplex formation between the primer sequence of the 
sequence reagent and sequences of the template nucleic acid 
molecule that are complementary to the primer sequence. Such 
conditions generally comprise conditions of stringent 
hybridization, such as the conditions described above for 

30 hybridization of a target nucleic acid sequence and a short 
oligonucleotide primer or probe. 

Primer sequences as short as 3-7 bases cannot hybridize 
to a template molecule to form stable duplexes. However, 
such primers will form transient duplexes in the presence of 

35 a polymerase with complementary regions of a template so that 
template dependent extension of the primer occurs. Such 
duplexes will only form between the template nucleic acid 
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molecule and those sequence specific hybridization regions 
for which the complementary sequence is present, in one or 
more copies, in the template nucleic acid molecule. 

5 5.2.4. PRIMER EXTENSION 

Preferably, before the primer extension reaction is 
performed, the template is capped by the addition of a 
terminator to the 3' -end of the template. The terminator is 
one which is capable of terminating a template-dependent, 

10 primer extension reaction. The template is thus capped so 
that no additional nucleotides or nucleotide analogs will 
attach to the 3' -end of the template. The 3 » -end of the 
template may be capped, e.g., by a dideoxynucleotide, or by 
other chain terminating nucleotides, such as those described 

15 below, in Section 5.3.4. 

The conditions for the template-dependent primer 
extension reaction require the presence of a suitable 
template dependent enzyme, such as a DNA or RNA polymerase. 
However, the polymerase must be both primer and template 

2 0 dependent. Preferably, the polymerase is also a thermostable 
and/or exonuclease free polymerase. Exemplary DNA 
polymerases which may be used in the methods of the invention 
include, for example, E. coll DNA pol I or the "Klenow 
fragment" thereof, T4 DNA polymerase, T7 DNA polymerase 

25 (i.e., "Sequenase") , T . aquaticus DNA polymerase, or a 

retroviral reverse transcriptase. RNA polymerases, such as 
T3 or T7 RNA polymerase, can also be used in certain 
embodiments . 

The conditions for the template-dependent primer 
30 extension reaction further require the presence of any one or 
more "nucleotide moieties", i.e., one or more of the 
naturally occuring nucleotides or analogs thereto. 
Preferably, the duplexes are incubated in the presence of all 
four of the natural deoxynucleoside triphosphates (dNTP's), 
35 dATP, dCTP, dGTP, and dTTP, or in the presence of analogues 
to all four of the above dNTP's. Alternatively, one or more 
of the nucleotide moieties may comprise a ribonucleotide 
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triphosphate (rNTP) In one embodiment, the template-primer 
duplex is incubated in the presence of non-extendible or 
chain terminating nucleotides. Exemplary chain terminating 
nucleotides include 2 ' , 3 • -dideoxynucleotide triphosphate 
5 derivatives of adenine, thymine cytosine, and guanine. 

In addition to dideoxynucleotides, 3" phosphate modified 
oligonucleotides which are complementary to a template 
nucleic acid molecule and effectively block DNA 
polymerization, can be used in the present invention (see, 
10 e.g., Romberg et al . , 1992, In: DNA Replication, 2nd 
Edition, Kornberg et al . , eds. , W.H. Freeman & Co., San 
Francisco, pp. 408, 446-449) . Alternatively, a nucleotide 
analog, such as a fructose based nucleotide analog or a 
chemically modified purine or pyrimidine that retains the 
15 ability to specifically base pair with naturally occurring 
nucleotides may be used to block DNA polymerization. A 
variety of 3 ' -substituted nucleotides ( see, e.g., Antrazhev, 
1987, Bioorg. Khim 13:1045-52; Chidgeavadze, Z.G. et al . , 
1986, Biochemi. Biophys . Acta 868:145-152; Chidzhacadze et 
20 al . , 1989, Mol . Biol. (Mosk.) 23:1732-1742) such as azido- 
(Mitsuya et al . , 1986, Proc . Natl. Acad. Sci . U.S.A. 
83:1191), mercapto- (Yuzhakov et al . , 1992, FEBS Letters 
306:185-188), amino- (Herrein et al . , 1994, Helvetica Chimica 
Acta 77:586-596), and fluoro- (Chidgeavadze, Z.G. et al . , 
25 1985, FEBS Letters 183:275-278) substituted nucleotides, 

which have been reported to terminate DNA synthesis, can be 
used in the present invention. 

The nucleotide or nucleotide analog can be detectably 
labeled, preferably with a fluorescent molecule or haptenated 
3 0 deoxy- or dideoxynucleotide. Alternatively, the nucleotide 
can be detected, e.g., using delayed extraction MALDI-TOF 
mass spectrometer (see, e.g., Haff, L.A. et al . , 1997, Genome 
Methods 7:378-388). MALDI-TOF mass spectrometry is capable 
of determining the identity of the incorporated non- 
35 extendible nucleotide by the change in mass of the extended 
primer . 
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The use of f luorescently labeled nucleotides and 
nucleotide analogues in more preferable. Other labels 
suitable for use in the present invention include, but are 
not limited to, biotin, iminobiotin, antigens, cof actors, 
5 dinitrophenol, lipoic acid, olefinic compounds, detectable 
polypeptides, electron rich molecules, enzymes capable of 
generating a detectable signal, and radioactive isotopes. 
The preferred radioactive isotopes include 32 P, 35 S, 14 C, and 
125 I. Fluorescent molecules suitable for the present invention 

10 include, but are not limited to, fluorescein, rhodamine, 

texas red, FAM, JOE, TAMRA, ROX, HEX, TET, Cy3 , Cy3.5, Cy5, 
Cy5.5, IRD40, IRD41, and BODIPY. As used herein, "FAM" 
refers to 5 'carboxy-f luorescein, "JOE" refers to 2 , ,7»- 
dimethoxy-4 • , 5 ' -dichloro-6-carboxy-f luorescein , "TAMRA" 

15 refers to N , N , N ' , N ' -tetramethy 1-6-carboxy-rhodamine , "ROX" 
refers to 6-carboxy-X-rhdoamine. Electron rich indicator 
molecules suitable for the present invention include, but are 
not limited to, ferritin, hemocyanin, and colloidal gold. 
Alternatively, the polypeptide may be indirectly detected by 

20 specifically complexing a first group to the polypeptide. A 
second group, covalently linked to an indicator molecule, 
which has affinity for the first group could be used to 
detect the polypeptide. In such an embodiment, compounds 
suitable for use as a first group include, but are not 

25 limited to, avidin and strepavidin. Compounds suitable for 
use as a second group include, but are not limited to, biotin 
and iminobiotin. 

The exact conditions necessary for the template- 
dependent primer extension reaction will be determined by the 

30 specific polymerase used for the reaction, as well as by the 
choice of nucleotide or nucleotide analogues. These 
conditions for specific embodiments of the primer extension 
reaction will be apparent to those of skill in the art. 

In one preferred embodiment, the template nucleic acid 

35 molecule is separated from the sequence reagent after the 

extension reaction. This embodiment thus allows the template 
to be recovered, e.g., for further analysis. The template 
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molecule is separated from the sequence reagent by means of 
appropriate denaturing conditions that are well known to 
those of skill in the art; e.g., incubating the template- 
reagent complex with heat, alkali, formamide, urea, glyoxal, 
5 enzymes, and combinations thereof. 

5.2.5. DETECTION AND ANALYSIS 
Once polymerase extension products have been permitted 
to form, the extension product is detected for each sequence 

10 reagent. For embodiments wherein labeled nucleotides or 
nucleotide analogues are used in the primer extension 
reaction, signal detection consists of simply detecting the 
labeled nucleotide. 

For example, in embodiments wherein fluorescent ly 

15 labeled nucleotides or nucleotide analogues are used, signal 
detection is accomplished simply by detecting a fluorescent 
signal at the wavelength emitted by the fluorophore. In 
other embodiments, wherein the nucleotide or nucleotide 
analogues are labeled by means of radioactive isotopes, e.g., 

20 32 P or 35 S labeled dNTP's or ddNTP's, extension products may be 
detected by using autoradiography to detect the radioactive 
base. In yet other embodiments which use chemical labels 
such as biotin, the labeled nucleotide or nucleotide analog 
may be detected, e.g. , by means of a fluorescent probe or dye 

25 such as RPE Protein Dye (Molecular Probe) . 

The results of signal detection for an array of sequence 
reagents, each of which comprises a unique primer sequence, 
can be represented as a digital matrix or pattern of binary 
"on/off" values for each sequence reagent of the sequence 

30 array wherein either an extension product is detect ("on"- 
value) or no extension product is detected ("off-value) . 
Each binary signal of the resulting matrix thus indicates 
that are particular primer sequence is ("on") or is not 
("off") present in the template nucleic acid molecule. The 

35 nucleotide sequence of the template molecule is thereby 
represented as a unique digital pattern. 
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Such digital data can readily be stored as binary 
patterns in a computer readable format and loaded into 
computer memory for further manipulation or analysis. The 
binary patterns can be readily analyzed by simply comparing 
them, e.g., to the known binary patterns of other wild type 
of mutation nucleotide sequences. Such comparisons can be 
made simply by the manual inspection of such patterns by a 
user, or by means of a computer algorithm using the binary 
pattern read into computer memory. 

Known binary patterns can be readily obtained, e.g., by 

applying the above methods of PSA to a template nucleic acid 
molecule consisting of a known wild type or mutation 

sequence. Alternatively, the binary pattern of a known wild 

type or mutation sequence can be predicted from its nucleic 

acid sequence. 

Sequences of approximately 3 00-500 bases can be analyzed 
by the above methods using arrays of 1024 unique sequence 
reagent comprising primer sequences of 5 bases with a high 
probability of detecting and characterizing all possible 
mutations within the sequence. Nucleotide sequences of about 
100 bases can be analyzed using 256 element arrays having 
primer sequences four bases in length. Arrays of comprising 
primer sequences six bases in length are appropriate for 
characterizing template nucleic acid molecules of about 500- 
1,500 bases. 

6 . EXAMPLES 

The examples presented here illustrate the concept and 
practice of the polymerase signaling assay methods of the 

0 invention. Specifically, the examples first demonstrate the 
application of the methods of the invention to comparatively 
analyzing the sequences of a "wild type" and "mutant" DNA 
oligonucleotide sequence. The examples demonstrate, second, 
the use of the polymerase signaling assay methods of the 

5 invention to analyze template nucleic acid molecules 

corresponding to an exon of the p53 gene, including template 
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nucleic acid molecules corresponding to mutant and 
polymorphic sequences of that gene. 

The examples are presented by way of illustration of the 
previously described invention, and are not limiting of that 
5 description in any way. 

6.1. EXAMPLE It USE OF PSA TO IDENTIF Y SINGLE BASE 
SUBSTITUTIONS IN A SYNTH FTTC TEMPLATE 

This example analyzes two synthetic DNA templates that 

vary by a single base using the Polymerase Signaling Assay 

10 

(PSA) method of the present invention on a solid-phase 
sequence array. The results of this example demonstrate that 
the PSA methods of the present invention produce different 
patterns of signals for the two different template sequences 
as predicted. 

6.1.1. MATERIALS AND METHODS 

Template Synthesis : 

DNA oligonucleotide sequences which are complementary to 
the "wild type" and "mutant" sequences listed in Table I were 
synthesized using standard techniques of phosphoramidite 
chemistry (see, e.g., Matteucci, M.D. et al . , supra) as 
templates for PSA analysis. The oligonucleotides were 
synthesized to further contain a 5'-(C) 18 tail to provide a 
capture region for attachment of the template to the sequence 
reagents . 



20 



25 



30 



Primer Synthesis ; 

The 21 primer sequences shown in Table II were 
synthesized using the same phosphoramidite chemistry 
techniques as were used for template synthesis, describe 
above. The primers were synthesized to additionally contain 
a 5' -(G) , 8 -(N) 7 tail of 2 ' -O-Me RNA nucleotides which 
functioned as a capture, and spacer region for the sequence 
35 reagent. A thiol group was incorporated at the 5' -terminus 
of the sequence reagent to function as an attachment moiety 
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to secure the reagent to the glass surface of the sequence 
array . 

Array Formation : 
5 The 21 sequence reagents were printed in a five by five 

array on glass slides, according to the methods described in 
Section 5.2.3 and 5.2.4, above. 

Polymerase Signal Assay ; 

10 Each template was attached to the sequence array by 

incubation at a final concentration of 100 nM in a 1.5 M NaCl 
solution at room temperature for 1 hour. Slides were washed 
three times in IX TNTw (Tris pH 8, Tween 0.5%, NaCl 150 mM) 
followed by incubation with an extension mix (described in 

15 Head, S. et al . , 1997, supra) in which all four ddNTP ■ s were 
labeled with Biotin. After 30 minutes at room temperature, 
the extension reaction was terminated by washing the slides 
three times in IX TNTw. 3 00 m of RPE Protein Dye (Molecular 
Probes) was incubated on each array for 3 0 minutes, the 

2 0 arrays were washed and scanned on a Hitachi FMBIO. 

6.1.2. RESULTS 
The purpose of this experiment was to analyze two 
template DNA molecules that vary by a single nucleotide base 

25 using the PSA methods of the present invention. To this end, 
two template deoxyribo-oligonucleotide sequences were 
synthesized having the nucleotide sequences listed in Table 
I. For convenience, the two sequences are referred to herein 
as the "wild type" and "mutant" sequences, respectively. The 

30 two sequences differed by only one nucleotide base; 

specifically by the substitution of a guanine for cytosine at 
position 8 of the wild type sequence shown in Table I. This 
single nucleotide polymorphisms is indicated in Table I by 
the underlined base in each of the two sequences. The two 

35 template sequence also had 5 , -(C) 18 tails to function as the 
capture region for attachment to the capture moiety of the 
sequence reagent. 
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TABLE I 



5 



wild type sequence 


5 ' -GTCTCTCCCAGGACAGGCACA- 3 ' 


mutant sequence 


5 ' -GTCTCTCGCAGGACAGGCACA- 3 ' 



A sequence array was constructed for sequence analysis 
of each of the two templates according to the methods 
described in Section 5.2, above. The sequence reagents used 

10 in the array had the schematic structure 

S • S- (G) 18 - (N) 7 -NXXXXX 
wherein a disulfide bond (S«S) functioned as the attachment 
moiety, a (G) 1% base sequence functioned as the capture 
moiety, and a string of seven mixed bases ( (N) 7 ) functioned 

15 as the spacer region. 2 1 -0-Me RNA bases were used for the 
nucleotides of both the capture moiety and the spacer region. 
The primer sequence of each sequence reagent comprised a 
specific base sequence of four nucleotides which functioned 
as primers to the templates during the assay. The sequence 

20 reagents were printed in a five by five array on glass 

slides, according to the methods described in Section 5.2, 
above . 

Twenty-one primer sequences were used in the assay which 
had signal sequences found in both the wild type and mutant 

25 templates. Their sequences are listed in Table II, below, 
along with the coordinates of their location in the sequence 
array. Table II further indicates whether primer extension 
is predicted for each of the 21 primer sequences, i.e., 
whether the five base primer sequence is present in either 

30 the wild type or mutant template sequence. 



35 



- 35 - 



PENY4-712720.1 



TABLE II 



5 



15 



20 



25 



Primer 
Location 


Signal 
Sequence 


Expect Signal 
from Wild Type 


Expect Signal 
from Mutant 


A2 


CCTGC 


NO 


YES 


A3 


TGTGC 


YES 


YES 


A4 


GTGCC 


YES 


YES 


Bl 


TGCCT 


YES 


YES 


B2 


GCCTG 


YES 


YES 


B3 


CCTGT 


YES 


YES 


B4 


CTGCG 


NO 


YES 


B5 


CTGTC 


YES 


YES 


CI 


TGTCC 


NO 


YES 


C2 


CGAGA 


YES 


YES 


C3 


GTCCT 


YES 


YES 


C4 


TCCTG 


YES 


YES 


C5 


CCTGG 


YES 


NO 


Dl 


CTGGG 


YES 


NO 


D2 


TGGGA 


YES 


YES 


D3 


TGCGA 


NO 


YES 


D4 


GGGAG 


YES 


NO 


D5 


GGAGA 


YES 


NO 


E2 


GAGAG 


YES 


YES 


E3 


AGAGA 


YES 


YES 


E4 


GCGAG 


NO 


YES 



The expected data signals for the assay are illustrated 
graphically in FIG. 3A for the wild type template molecule, 
and in FIG. 3C for the mutant template molecule. 

The resultant experimental data patterns from PSA 
analysis of the template molecules are shown in FIGS. 3B and 
3D. FIG. 3B shows the resultant digital matrix from PSA 
analysis of the wild type sequence, while FIG 3D shows the 
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digital matrix obtained from PSA analysis of the mutant 
sequence. The two arrays show distinct signal patterns 
reflective of the sequence of the two different templates. 

5 6 ' 2 ' EXAMPLE 2: USE OF PSA TO IDENTIFY MUTATIONS AND 

POLYMORPHISMS IN A P53 GENE 

This example analyzes the 103 base pair sequence of p53 

exon 8 using the PSA methods of the present invention on a 

solid-phase sequence array. The results of this example 

1Q demonstrate that the PSA methods of the invention can 

identify several varieties of genetic mutations and/or 

polymorphisms of an actual gene, including single nucleotide 

polymorphisms, base deletions, base insertions, and 

heterozygote polymorphisms. 

15 

The purpose of this experiment was to determine the 
potential applicability of the PSA methods of the invention 
for analyzing template nucleic acid molecules corresponding 
to actual genetic sequences. To this end, the PSA methods 

20 were used to analyze wild type and mutant sequences of the 
106 base exon 8 of the p53 gene (SEQ ID NO. — ) . 

Accordingly, using a computer model, expected binary 
data was generated for PSA analysis of every possible point 
mutation of the p53 exon 8 nucleic acid sequences, thereby 

25 generating data for 3x106=318 different nucleic acid 

sequences. The predicted digital matrices are illustrated in 
FIGS. 4A-6C for certain, specific sequences. 

The results show that about 97% of the mutations could 
be characterized for the p53 exon 8 sequence of approximately 

3Q 100 bases using a 256 primer array of 4 base primers. The 
expected digital array generated by the wild type sequence of 
p53 exon 8 is illustrated schematically in FIG. 4A, while 
FIG. 4B illustrates the expected digital array generated by a 
mutant sequence wherein the nucleotide 38C has been converted 

35 to a T. The difference between the two digital matrices is 
show in FIG. 4C, demonstrating that this single nucleotide 
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polymorphisms generates a unique digital pattern which 
enables it to be distinguished from the wild type sequence by 
the PSA methods of the invention. FIGS. 5A-C show similar 
results for the single nucleotide polymorphism wherein the 
5 nucleotide 38C is transverted to an A, demonstrating that 
this sequence also generates an unique digital pattern 
enabling it to be characterized by the PSA methods of the 
invention. 

FIGS. 6A-C illustrate the results of the experiment for 
10 a p53 exon 8 template sequence having a heterozygous 

polymorphism. The heterozygous template contains two alleles 
of the p53 exon 8 nucleotide sequence. The first allele 
comprises the wild type p53 exon 8 sequence, while the second 
allele comprises the SNP described for FIG. 4B, above; i.e., 
15 the SNP wherein the base 38C is converted to a T. This 

heterozygous template produces a unique digital matrix which 
can be readily distinguished from the digital signal produced 
by PSA analysis of the wild type sequence, as illustrated in 
FIG. 6C which shows the difference between the two signals. 
20 Further, as evidenced by a visual inspection of the two 
figures, the signal generated by the heterozygous SNP and 
illustrated in FIG. 6B is also unique from the homozygous 
SNP's PSA signal pattern illustrated in FIG. 4B. 

Expected binary data was also generated for PSA analysis 
25 of p53 exon 8 sequences having base deletions, insertions, or 
heterozygous mutations. These results are illustrated in 
FIGS. 7A-8C. Specifically, FIG. 7B illustrates the binary 
signal pattern predicted for PSA analysis of a p53 exon 8 
template nucleic acid molecule having a five base deletion. 
3 0 The predicted signal for the wild type p53 exon 8 sequence is 
illustrated in FIG. 7A, and their difference is shown in FIG. 
7C. Likewise, FIG. 8B illustrates the binary signal pattern 
predicted for PSA analysis of a p53 exon 8 template 
containing a five base insertion, and FIG. 8C compares this 
35 signal with that of the wild type p53 exon 8 sequence, 
illustrated in FIG. 8A. The results show that these 
mutations generate signal which enable them to also be 
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uniquely characterized by the PSA methods of the present 
invention. 

7. REFERENCES CITED 
5 All references cited herein are incorporated herein by 

reference in their entirety and for all purposes to the same 
extent as if each individual publication or patent or patent 
application was specifically and individually indicated to be 
incorporated by reference in its entirety for all purposes. 

10 Many modifications and variations of this invention can 

be made without departing from its spirit and scope, as will 
be apparent to those skilled in the art. The specific 
embodiments described herein are offered by way of example 
only, and the invention is to be limited only by the terms of 

15 the appended claims, along with the full scope of equivalents 
to which such claims are entitled. 
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WHAT IS CLAIMED IS: 

1. A method for analyzing the sequence of a template 
comprising: 

(a) capturing the template; 

(b) scanning the captured template using a primer- 
polymerase complex for regions of complementarity 
to the primer; 

(c) extending the primer by one or more nucleotide 
moieties by means of a template-homology dependent 
extension reaction; and 

(d) detecting the extended primer, 

wherein detection of the extended primer indicates the 
presence of one or more regions of complementarity to the 
primer in the captured template. 

2. The method of Claim 1 wherein the primer comprises a 
polynucleotide of 3 to 7 bases. 

3. A method for analyzing the sequence of a template 
nucleic acid according to the methods of Claim 1, wherein the 
steps of the method are repeated for an array of primer- 
polymerase complexes so that a pattern of signals is 
generated for the template. 

4. The method of Claim 3 wherein the array is an array of 
sequence reagents, each sequence reagent comprising: 

(i) a capture moiety; 

(ii) a spacer moiety; and 

(iii) a primer region. 

5. The method of Claim 4 wherein the sequence reagents are 
immobilized to a solid surface. 

6. The method of Claim 5 wherein the solid surface is glass 
or plastic. 
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7. The method of Claim 5 wherein the solid surface is a 
glass plate, a quartz wafer, a nylon membrane, a 
nitrocellulose membrane, or a silicon wafer. 

5 8. The method of Claim 5 wherein the solid surface is 
silicon class. 

9. The method of Claim 5 wherein the solid surface is 
polystyrene plastic. 

10 

10. The method of Claim 4 wherein the sequence reagent 
further comprises an attachment moiety. 

11. The method of Claim 10 wherein the attachment moiety is 
15 located at or near the 5 1 -terminus of the sequence reagent. 

12 . The method of Claim 10 wherein the attachment moiety is 
an amino group, a thiol group, a disulfide group, or a biotin 
group . 

20 

13 . The method of Claim 4 wherein the capture moiety is on a 
first reagent and the primer region is on a second reagent. 

14. The method of Claim 13 wherein the first reagent is 
25 proximal to the second reagent on a solid phase. 

15 . The method of Claim 4 wherein the capture moiety 
comprises a sequence of 8-24 cytosine bases. 

30 16. The method of Claim 4 wherein the capture moiety 

comprises a specific sequence complementary to a PCR primer 
or a portion thereof. 

17. The method of Claim 4 wherein the spacer region is at 
35 least 10 A in length. 
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18. The method of Claim 4 wherein the spacer region 
comprises a random, pseudo-random, or non-random sequence of 
nucleotide bases or analogs thereto. 



5 19. The method of Claim 1 wherein the nucleotide moieties 
are non-chain terminating nucleotides or nucleotide 
analogues . 

20. The method of Claim 19 wherein the nucleotide moieties 
10 are deoxynucleoside triphosphate bases or ribonucleoside 

triphosphate bases. 

21. The method of Claim 1 wherein the nucleotide moiety is a 
chain terminating nucleotide analogue. 

15 

22. The method of Claim 21 wherein the chain terminating 
nucleotide analogue is a dideoxynucleotide. 

23. The method of Claim 1 wherein the nucleotide moiety is 
20 detectably labeled. 

24. The method of Claim 23 wherein the detectable label is a 
fluorescent label. 

25 25. The method of Claim 23 wherein the detectable label is a 
radioactive isotope. 

26. The method of Claim 2 3 wherein the detectable label is 
an electron rich molecule. 

30 

27. The method of Claim 1 wherein the extended primer is 
detected by change in mass. 

28. The method of Claim 4 wherein the density of sequence 
35 reagents in the array is at least 1000 elements/cm 2 . 
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10 



29. A sequence array comprising one or more sequence 
reagents in an orderly arrangement wherein each reagent 
comprises : 

(i) a capture moiety which can form a stable 
complex with a region of a template nucleic 
acid molecule; 

(ii) a spacer region; and 

(iii) a primer region, wherein said primer region 
comprises 3-7 bases. 

30. The sequence array of Claim 29 wherein the array 
comprises a set, subset, or combination of 4 3 - 4 7 different 
sequence reagents. 



15 



20 



25 



30 



35 
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ABSTRACT 

The present invention relates to the field of nucleic 
acid sequence analysis. The invention provides methods for 
performing a polymerase signal assay (PSA) to analyze 
5 nucleotide sequences using solid phase sequence arrays 
comprising a plurality of sequence reagents with primer 
sequences 4-6 bases in length. The methods of the invention 
generate a binary signal pattern which can be used to 
identify nucleic acid sequences and/or mutations and 
10 polymorphisms of a nucleic acid sequence. Mutations and 
polymorphisms which can be identified by the methods of the 
invention include single nucleotide polymorphisms (SNP's), 
base deletions, base insertions, and heterozygous as well 
homozygous polymorphisms. 



20 



25 



30 



35 
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DECLARATION 
AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below at 201 et scq. underneath my name. 

I believe I am the original, first and sole inventor if only one name is listed at 201 below, or an original, first and joint inventor if plural names 
arc listed at 201 et seq. below, of the subject matter which is claimed and for which a patent is sought on the invention entitled 

POLYMERASE SIGNALING ASSAY 



(if applicable) 

_____ __ (far declaration not accompanying application) 



and was amended under PCT Article 19 on 



and for which a patent application: 

IX is attached hereto and includes amendment(s) filed on 

□ was filed in the United States on as Application No. 

with amendment(s) filed on y appUcabk) 

□ was filed as PCT international Application No. 

(if applicable) 

1 hereby state that 1 have reviewed and understand the contents of the above identified application, including the claims, as amended by any 
amendment referred to above. 

I acknowledge the duty to disclose information known to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
§1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, §1 19(a)-(d) of any foreign application(s) for patent or inventor's 
certificate listed below and have also identified below any foreign application for patent or inventor's certificate having a filing date before that 
of the application on which priority is claimed: 



EARLIEST FOREIGN APPLICATION(S), IF ANY, FILED PRIOR TO THE FILING DATE OF THE APPLICATION 


APPLICATION NUMBER 


COUNTRY 


DATE OF FILING 
(day, month, year) 


PRIORITY 
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I hereby claim the benefit under Title 35, United States Code, §119(e) of any United States provisional application(s) listed below. 
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FILING DATE 











I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) listed below and, insofar as the subject 
matter of each of the claims of this application is not disclosed in the prior United States application in the manner provided by the first 
paragraph of Title 35, United States Code §1 12, I acknowledge the duty to disclose information which is material to patentability as defined 
in Title 37, Code of Federal Regulations, § 1 .56 which became available between the filing date of the prior application and the national or PCT 
international filing date of this application: 
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ABANDONED 























POWER OF ATTORNEY: As a named inventor. I hereby appoint S. Leslie Misrock (Reg. No. 18872), Harry C. Jones, m (Reg. No. 202S0), Berj A Terzian 
(Reg. No. 20060), Gerald J. Flinloft (Reg. No. 20823), David Weild, III (Reg. No. 21094), Jonathan A. Marshall (Reg. No. 24614), Barry D. Rein (Reg No 
2241 1), Stanton T. Lawrence, in (Reg. No. 25736), Isaac Jarkovsky (Reg. No. 22713), Joseph V. Colaianni (Reg. No. 20019), Charles E. McKenney (Reg 
No. 22795), Philip T. Shannon (Reg. No. 24278), Francis E. Morris (Reg. No. 24615), Charles E. Miller (Reg. No. 24576), Gidon D. Stern (Reg No 27469) 
John J. Lamer, Jr. (Reg. No. 27S14), Brian M. Poissant (Reg. No. 28462), Brian D. Coggio (Reg. No. 27624), Rory J. Radding (Reg. No. 28749) Stephen 
J. Harbulak (Reg. No. 29166), Donald J. Goodell (Reg. No. 19766). James N. Palik (Reg. No. 25510), Thomas E. Friebel (Reg. No. 29258), Laura A Coruzzi 
(Reg. No. 30742), Jennifer Gordon (Reg. No. 30753), Jon R. Stark (Reg. No. 3011 1), Allan A. Fanucci (Reg. No. 30256), Geraldine F. Baldwin (Reg No 
31232), Victor N. Balancia (Reg. No. 31231). Samuel B. Abrams (Reg. No. 30605), Steven I. Wailach (Reg. No. 35402), Marcia H. Sundeen(Reg No 30893) 
Paul J. Zegger (Reg. No. 33821), Edmond R. Bannon (Reg. No. 32110), Bruce J. Barker (Reg. No. 33291), Adriane M. Antler (Reg. No. 32605), Thomas 
G. Rowan (Reg. No. 34419), Ann L. Gisolfi (Reg. No. 31956), Mark A. Farley (Reg. No. 33170). and James G. Markey (Reg. No. 31636), all of Pennie & 
Edmonds llp, whose addresses are 1 1 55 Avenue of the Americas, New York, New York 10036. 1667 K Street N.W., Washington, DC 20006 and 3300 Hillview 
Avenue. Palo Alto, CA 94304. and each of them, my attorneys, to prosecute this application, and to transact all business in the Patent and Trademark Office 
connected therewith. 
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